Mitigating the Position Bias of Transformer Models in Passage Re-ranking
نویسندگان
چکیده
Supervised machine learning models and their evaluation strongly depends on the quality of underlying dataset. When we search for a relevant piece information it may appear anywhere in given passage. However, observe bias position correct answer text two popular Question Answering datasets used passage re-ranking. The excessive favoring earlier positions inside passages is an unwanted artefact. This leads to three common Transformer-based re-ranking ignore parts unseen passages. More concerningly, as set taken from same biased distribution, overfitting that overestimate true effectiveness. In this work analyze datasets, contextualized representations, effect retrieval results. We propose debiasing method datasets. Our results show model trained position-biased dataset exhibits significant decrease effectiveness when evaluated debiased demonstrate by mitigating bias, are equally effective dataset, well more transfer-learning setting between differently
منابع مشابه
Experiments with LSA for Passage Re-Ranking in Question Answering
As in the previous QA@CLEF track, two separate groups at the University of Alicante participated this year using different approaches. This paper describes the work of Alicante 1 group. We have continued with the research line established in the past competition, where the main goal was to obtain a fully data-driven system based on machine learning techniques. Last year an XML framework was est...
متن کاملthe use of appropriate madm model for ranking the vendors of mci equipments using fuzzy approach
abstract nowadays, the science of decision making has been paid to more attention due to the complexity of the problems of suppliers selection. as known, one of the efficient tools in economic and human resources development is the extension of communication networks in developing countries. so, the proper selection of suppliers of tc equipments is of concern very much. in this study, a ...
15 صفحه اولRe-Ranking Models for Spoken Language Understanding
Spoken Language Understanding aims at mapping a natural language spoken sentence into a semantic representation. In the last decade two main approaches have been pursued: generative and discriminative models. The former is more robust to overfitting whereas the latter is more robust to many irrelevant features. Additionally, the way in which these approaches encode prior knowledge is very diffe...
متن کاملTopics in Mitigating Radar Bias
In this paper, we investigate two topics related to mitigating the effect of radar bias in ballistic missile tracking applications. We determine the absolute bias between two radars in polar coordinates when their relative bias is given in rectangular coordinates. Using this result, we then obtain the optimized steady-state filter to handle the random bias.
متن کاملHMM-based Passage Models for Document Classification and Ranking
We present an application of Hidden Markov Models to supervised document classification and ranking. We consider a family of models that take into account the fact that relevant documents may contain irrelevant passages; the originality of the model is that it does not explicitly segment documents but rather considers all possible segmentations in its final score. This model generalizes the mul...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-72113-8_16